Overview

Dataset statistics

Number of variables19
Number of observations4500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 MiB
Average record size in memory914.1 B

Variable types

Text6
Numeric3
DateTime1
Categorical9

Alerts

Cluster is highly overall correlated with PatientIncomeHigh correlation
PatientIncome is highly overall correlated with ClusterHigh correlation
ClaimLegitimacy is highly imbalanced (67.3%)Imbalance
ClaimID has unique valuesUnique
PatientID has unique valuesUnique
ProviderID has unique valuesUnique
PatientIncome has unique valuesUnique

Reproduction

Analysis started2026-01-11 18:21:54.022805
Analysis finished2026-01-11 18:21:56.660254
Duration2.64 seconds
Software versionydata-profiling vv4.18.0
Download configurationconfig.json

Variables

ClaimID
Text

Unique 

Distinct4500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size373.7 KiB
2026-01-11T18:21:56.905644image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length36
Median length36
Mean length36
Min length36

Characters and Unicode

Total characters162000
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4500 ?
Unique (%)100.0%

Sample

1st row4d76c7f7-d36a-4139-b451-a9a4ad10d7d5
2nd rowe35193b4-3609-492b-866a-98de19317e9c
3rd row1f3fa373-25ed-4ff4-b6c7-38dcb2fb297f
4th rowaf6a68f4-8319-47b1-a28b-77de01572851
5th row417fe944-79d2-4610-81c4-a2d496f29ee4
ValueCountFrequency (%)
417fe944-79d2-4610-81c4-a2d496f29ee41
 
< 0.1%
291cfa64-9956-40e7-b89f-4628650f42f01
 
< 0.1%
4d76c7f7-d36a-4139-b451-a9a4ad10d7d51
 
< 0.1%
e35193b4-3609-492b-866a-98de19317e9c1
 
< 0.1%
1492c9c7-e184-413d-b951-f4377400782f1
 
< 0.1%
a1684758-40b1-4f1d-8f5b-409c7228dbac1
 
< 0.1%
2c2ed3f4-90c6-4681-94b4-d20278d859631
 
< 0.1%
379d7c46-3096-4741-9d42-26f5403470701
 
< 0.1%
ab6b425f-957e-4448-a715-97d8aabddb6d1
 
< 0.1%
e1464b6a-4ea4-4fa1-952d-e16ebdd032c51
 
< 0.1%
Other values (4490)4490
99.8%
2026-01-11T18:21:57.273884image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
-18000
 
11.1%
412975
 
8.0%
89645
 
6.0%
a9608
 
5.9%
b9564
 
5.9%
99537
 
5.9%
68529
 
5.3%
f8511
 
5.3%
e8473
 
5.2%
28464
 
5.2%
Other values (7)58694
36.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412975
 
8.0%
89645
 
6.0%
a9608
 
5.9%
b9564
 
5.9%
99537
 
5.9%
68529
 
5.3%
f8511
 
5.3%
e8473
 
5.2%
28464
 
5.2%
Other values (7)58694
36.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412975
 
8.0%
89645
 
6.0%
a9608
 
5.9%
b9564
 
5.9%
99537
 
5.9%
68529
 
5.3%
f8511
 
5.3%
e8473
 
5.2%
28464
 
5.2%
Other values (7)58694
36.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412975
 
8.0%
89645
 
6.0%
a9608
 
5.9%
b9564
 
5.9%
99537
 
5.9%
68529
 
5.3%
f8511
 
5.3%
e8473
 
5.2%
28464
 
5.2%
Other values (7)58694
36.2%

PatientID
Text

Unique 

Distinct4500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size373.7 KiB
2026-01-11T18:21:57.546845image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length36
Median length36
Mean length36
Min length36

Characters and Unicode

Total characters162000
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4500 ?
Unique (%)100.0%

Sample

1st row19cf2638-3ec0-4ed9-9995-d9ba4553813a
2nd row5c4bb6c5-4dd3-4a86-85fa-f36c0d8debff
3rd row777866e0-4d10-45a8-a7b4-dbdaa26d5a81
4th row9d7c53ee-eb1a-4f07-9e3a-e86cf82e9f0f
5th rowdb14b0ca-ac2a-4e83-b085-947ea32e7587
ValueCountFrequency (%)
db14b0ca-ac2a-4e83-b085-947ea32e75871
 
< 0.1%
2bd2d173-4ce1-428d-836c-259d9236a8391
 
< 0.1%
19cf2638-3ec0-4ed9-9995-d9ba4553813a1
 
< 0.1%
5c4bb6c5-4dd3-4a86-85fa-f36c0d8debff1
 
< 0.1%
fb07a807-4dcc-4e09-bea6-4ca54acf6add1
 
< 0.1%
638c3542-dc16-4507-95f8-a1bb0c4256241
 
< 0.1%
bce42931-4ff7-487b-b373-773bdb57241b1
 
< 0.1%
09a26428-831a-4d5f-bd9f-ee790468aae51
 
< 0.1%
67f76baf-3c23-45ee-8898-ec4a25c85e111
 
< 0.1%
72f46521-1f31-4707-bbd9-4760af6d9d5c1
 
< 0.1%
Other values (4490)4490
99.8%
2026-01-11T18:21:57.917011image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
-18000
 
11.1%
412931
 
8.0%
89664
 
6.0%
99593
 
5.9%
a9583
 
5.9%
b9565
 
5.9%
d8596
 
5.3%
e8508
 
5.3%
58484
 
5.2%
68474
 
5.2%
Other values (7)58602
36.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412931
 
8.0%
89664
 
6.0%
99593
 
5.9%
a9583
 
5.9%
b9565
 
5.9%
d8596
 
5.3%
e8508
 
5.3%
58484
 
5.2%
68474
 
5.2%
Other values (7)58602
36.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412931
 
8.0%
89664
 
6.0%
99593
 
5.9%
a9583
 
5.9%
b9565
 
5.9%
d8596
 
5.3%
e8508
 
5.3%
58484
 
5.2%
68474
 
5.2%
Other values (7)58602
36.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412931
 
8.0%
89664
 
6.0%
99593
 
5.9%
a9583
 
5.9%
b9565
 
5.9%
d8596
 
5.3%
e8508
 
5.3%
58484
 
5.2%
68474
 
5.2%
Other values (7)58602
36.2%

ProviderID
Text

Unique 

Distinct4500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size373.7 KiB
2026-01-11T18:21:58.316770image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length36
Median length36
Mean length36
Min length36

Characters and Unicode

Total characters162000
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4500 ?
Unique (%)100.0%

Sample

1st rowa3d0cc80-dffe-40ff-a302-23c8ffeedb36
2nd rowa9f25acf-92b8-45e2-9cef-87bd07d0a591
3rd row951b1e08-9948-4956-80e5-9277f16bd290
4th rowde9e193a-f9a1-4d63-9345-aefe75694628
5th row5c7d7045-71b6-4c15-937c-43e4cfe65bf4
ValueCountFrequency (%)
5c7d7045-71b6-4c15-937c-43e4cfe65bf41
 
< 0.1%
cf84cf99-0ac3-465a-af90-239a873bafa51
 
< 0.1%
a3d0cc80-dffe-40ff-a302-23c8ffeedb361
 
< 0.1%
a9f25acf-92b8-45e2-9cef-87bd07d0a5911
 
< 0.1%
4cbf206c-b046-40d6-b953-927d2ed779501
 
< 0.1%
20685b18-4e11-4a78-b714-1d5e706103851
 
< 0.1%
9e339ef9-c22f-4a42-b299-857fbbc1fa811
 
< 0.1%
4d2bab66-8b53-40b4-9724-c8c77522e8c31
 
< 0.1%
866b8799-1436-4c1b-ae34-37e1fda57c991
 
< 0.1%
e84f0876-375d-40f4-a112-46c24ac596271
 
< 0.1%
Other values (4490)4490
99.8%
2026-01-11T18:21:58.722261image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
-18000
 
11.1%
412912
 
8.0%
89676
 
6.0%
99606
 
5.9%
a9544
 
5.9%
b9384
 
5.8%
f8602
 
5.3%
e8574
 
5.3%
18547
 
5.3%
58485
 
5.2%
Other values (7)58670
36.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412912
 
8.0%
89676
 
6.0%
99606
 
5.9%
a9544
 
5.9%
b9384
 
5.8%
f8602
 
5.3%
e8574
 
5.3%
18547
 
5.3%
58485
 
5.2%
Other values (7)58670
36.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412912
 
8.0%
89676
 
6.0%
99606
 
5.9%
a9544
 
5.9%
b9384
 
5.8%
f8602
 
5.3%
e8574
 
5.3%
18547
 
5.3%
58485
 
5.2%
Other values (7)58670
36.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)162000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
-18000
 
11.1%
412912
 
8.0%
89676
 
6.0%
99606
 
5.9%
a9544
 
5.9%
b9384
 
5.8%
f8602
 
5.3%
e8574
 
5.3%
18547
 
5.3%
58485
 
5.2%
Other values (7)58670
36.2%

ClaimAmount
Real number (ℝ)

Distinct4490
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5014.2039
Minimum100.12
Maximum9997.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.3 KiB
2026-01-11T18:21:58.859251image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum100.12
5-th percentile590.725
Q12509.0725
median5053.765
Q37462.4525
95-th percentile9510.307
Maximum9997.2
Range9897.08
Interquartile range (IQR)4953.38

Descriptive statistics

Standard deviation2866.2911
Coefficient of variation (CV)0.57163433
Kurtosis-1.2029995
Mean5014.2039
Median Absolute Deviation (MAD)2483.075
Skewness0.00042447153
Sum22563917
Variance8215624.5
MonotonicityNot monotonic
2026-01-11T18:21:59.171778image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7946.692
 
< 0.1%
9963.712
 
< 0.1%
8936.332
 
< 0.1%
6118.262
 
< 0.1%
6834.262
 
< 0.1%
6116.752
 
< 0.1%
8414.042
 
< 0.1%
4153.182
 
< 0.1%
862.12
 
< 0.1%
5540.342
 
< 0.1%
Other values (4480)4480
99.6%
ValueCountFrequency (%)
100.121
< 0.1%
100.31
< 0.1%
101.331
< 0.1%
106.471
< 0.1%
111.011
< 0.1%
113.41
< 0.1%
114.591
< 0.1%
115.491
< 0.1%
119.721
< 0.1%
131.861
< 0.1%
ValueCountFrequency (%)
9997.21
< 0.1%
9995.621
< 0.1%
9994.21
< 0.1%
9989.041
< 0.1%
9983.641
< 0.1%
9979.551
< 0.1%
9978.431
< 0.1%
9977.721
< 0.1%
9976.521
< 0.1%
9972.661
< 0.1%
Distinct731
Distinct (%)16.2%
Missing0
Missing (%)0.0%
Memory size35.3 KiB
Minimum2022-07-09 00:00:00
Maximum2024-07-08 00:00:00
Invalid dates0
Invalid dates (%)0.0%
2026-01-11T18:21:59.302775image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:59.440528image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct4495
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size237.4 KiB
2026-01-11T18:21:59.918922image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters22500
Distinct characters62
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4490 ?
Unique (%)99.8%

Sample

1st rowTa150
2nd rowFo766
3rd rowAX876
4th rowSQ441
5th rowFK970
ValueCountFrequency (%)
me7122
 
< 0.1%
zf7972
 
< 0.1%
tt2512
 
< 0.1%
rs5222
 
< 0.1%
yl7262
 
< 0.1%
ia7752
 
< 0.1%
ej0322
 
< 0.1%
xa2482
 
< 0.1%
vy1092
 
< 0.1%
ae0342
 
< 0.1%
Other values (4476)4480
99.6%
2026-01-11T18:22:00.408760image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41388
 
6.2%
21387
 
6.2%
71383
 
6.1%
51382
 
6.1%
91364
 
6.1%
61354
 
6.0%
31339
 
6.0%
11316
 
5.8%
81300
 
5.8%
01287
 
5.7%
Other values (52)9000
40.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)22500
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
41388
 
6.2%
21387
 
6.2%
71383
 
6.1%
51382
 
6.1%
91364
 
6.1%
61354
 
6.0%
31339
 
6.0%
11316
 
5.8%
81300
 
5.8%
01287
 
5.7%
Other values (52)9000
40.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)22500
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
41388
 
6.2%
21387
 
6.2%
71383
 
6.1%
51382
 
6.1%
91364
 
6.1%
61354
 
6.0%
31339
 
6.0%
11316
 
5.8%
81300
 
5.8%
01287
 
5.7%
Other values (52)9000
40.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)22500
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
41388
 
6.2%
21387
 
6.2%
71383
 
6.1%
51382
 
6.1%
91364
 
6.1%
61354
 
6.0%
31339
 
6.0%
11316
 
5.8%
81300
 
5.8%
01287
 
5.7%
Other values (52)9000
40.0%
Distinct4495
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size237.4 KiB
2026-01-11T18:22:00.779047image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters22500
Distinct characters62
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4491 ?
Unique (%)99.8%

Sample

1st rowiO013
2nd rowjR349
3rd rowuU479
4th rowXs264
5th rowPV476
ValueCountFrequency (%)
zw0983
 
0.1%
zf2512
 
< 0.1%
ty0992
 
< 0.1%
ze1122
 
< 0.1%
yg7532
 
< 0.1%
jw3782
 
< 0.1%
ln4072
 
< 0.1%
mt6102
 
< 0.1%
nr7742
 
< 0.1%
wq5342
 
< 0.1%
Other values (4475)4479
99.5%
2026-01-11T18:22:01.428097image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51419
 
6.3%
91395
 
6.2%
11385
 
6.2%
31369
 
6.1%
71368
 
6.1%
01350
 
6.0%
61344
 
6.0%
41309
 
5.8%
81296
 
5.8%
21265
 
5.6%
Other values (52)9000
40.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)22500
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
51419
 
6.3%
91395
 
6.2%
11385
 
6.2%
31369
 
6.1%
71368
 
6.1%
01350
 
6.0%
61344
 
6.0%
41309
 
5.8%
81296
 
5.8%
21265
 
5.6%
Other values (52)9000
40.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)22500
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
51419
 
6.3%
91395
 
6.2%
11385
 
6.2%
31369
 
6.1%
71368
 
6.1%
01350
 
6.0%
61344
 
6.0%
41309
 
5.8%
81296
 
5.8%
21265
 
5.6%
Other values (52)9000
40.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)22500
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
51419
 
6.3%
91395
 
6.2%
11385
 
6.2%
31369
 
6.1%
71368
 
6.1%
01350
 
6.0%
61344
 
6.0%
41309
 
5.8%
81296
 
5.8%
21265
 
5.6%
Other values (52)9000
40.0%

PatientAge
Real number (ℝ)

Distinct100
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.838444
Minimum0
Maximum99
Zeros45
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size35.3 KiB
2026-01-11T18:22:01.624177image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q125
median50.5
Q375
95-th percentile95
Maximum99
Range99
Interquartile range (IQR)50

Descriptive statistics

Standard deviation28.790471
Coefficient of variation (CV)0.57767595
Kurtosis-1.2092364
Mean49.838444
Median Absolute Deviation (MAD)25.5
Skewness-0.02178574
Sum224273
Variance828.89121
MonotonicityNot monotonic
2026-01-11T18:22:02.029786image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5764
 
1.4%
2559
 
1.3%
7058
 
1.3%
157
 
1.3%
1656
 
1.2%
8156
 
1.2%
4855
 
1.2%
7955
 
1.2%
7654
 
1.2%
9754
 
1.2%
Other values (90)3932
87.4%
ValueCountFrequency (%)
045
1.0%
157
1.3%
244
1.0%
339
0.9%
433
0.7%
538
0.8%
634
0.8%
733
0.7%
849
1.1%
951
1.1%
ValueCountFrequency (%)
9946
1.0%
9844
1.0%
9754
1.2%
9645
1.0%
9540
0.9%
9435
0.8%
9340
0.9%
9249
1.1%
9145
1.0%
9035
0.8%

PatientGender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size219.9 KiB
F
2282 
M
2218 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4500
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowM
3rd rowM
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F2282
50.7%
M2218
49.3%

Length

2026-01-11T18:22:02.201163image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:02.301149image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
f2282
50.7%
m2218
49.3%

Most occurring characters

ValueCountFrequency (%)
F2282
50.7%
M2218
49.3%

Most occurring categories

ValueCountFrequency (%)
(unknown)4500
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
F2282
50.7%
M2218
49.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown)4500
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
F2282
50.7%
M2218
49.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown)4500
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
F2282
50.7%
M2218
49.3%

ProviderSpecialty
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size264.6 KiB
Pediatrics
955 
Cardiology
907 
Orthopedics
893 
General Practice
880 
Neurology
865 

Length

Max length16
Median length11
Mean length11.179556
Min length9

Characters and Unicode

Total characters50308
Distinct characters22
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOrthopedics
2nd rowCardiology
3rd rowCardiology
4th rowCardiology
5th rowNeurology

Common Values

ValueCountFrequency (%)
Pediatrics955
21.2%
Cardiology907
20.2%
Orthopedics893
19.8%
General Practice880
19.6%
Neurology865
19.2%

Length

2026-01-11T18:22:02.417575image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:02.534626image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
pediatrics955
17.8%
cardiology907
16.9%
orthopedics893
16.6%
general880
16.4%
practice880
16.4%
neurology865
16.1%

Most occurring characters

ValueCountFrequency (%)
r5380
10.7%
e5353
10.6%
i4590
 
9.1%
o4437
 
8.8%
a3622
 
7.2%
c3608
 
7.2%
d2755
 
5.5%
t2728
 
5.4%
l2652
 
5.3%
s1848
 
3.7%
Other values (12)13335
26.5%

Most occurring categories

ValueCountFrequency (%)
(unknown)50308
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r5380
10.7%
e5353
10.6%
i4590
 
9.1%
o4437
 
8.8%
a3622
 
7.2%
c3608
 
7.2%
d2755
 
5.5%
t2728
 
5.4%
l2652
 
5.3%
s1848
 
3.7%
Other values (12)13335
26.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown)50308
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r5380
10.7%
e5353
10.6%
i4590
 
9.1%
o4437
 
8.8%
a3622
 
7.2%
c3608
 
7.2%
d2755
 
5.5%
t2728
 
5.4%
l2652
 
5.3%
s1848
 
3.7%
Other values (12)13335
26.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown)50308
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r5380
10.7%
e5353
10.6%
i4590
 
9.1%
o4437
 
8.8%
a3622
 
7.2%
c3608
 
7.2%
d2755
 
5.5%
t2728
 
5.4%
l2652
 
5.3%
s1848
 
3.7%
Other values (12)13335
26.5%

ClaimStatus
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size246.2 KiB
Approved
1522 
Denied
1512 
Pending
1466 

Length

Max length8
Median length7
Mean length7.0022222
Min length6

Characters and Unicode

Total characters31510
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPending
2nd rowDenied
3rd rowPending
4th rowPending
5th rowApproved

Common Values

ValueCountFrequency (%)
Approved1522
33.8%
Denied1512
33.6%
Pending1466
32.6%

Length

2026-01-11T18:22:02.715217image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:02.823563image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
approved1522
33.8%
denied1512
33.6%
pending1466
32.6%

Most occurring characters

ValueCountFrequency (%)
e6012
19.1%
d4500
14.3%
n4444
14.1%
p3044
9.7%
i2978
9.5%
o1522
 
4.8%
A1522
 
4.8%
r1522
 
4.8%
v1522
 
4.8%
D1512
 
4.8%
Other values (2)2932
9.3%

Most occurring categories

ValueCountFrequency (%)
(unknown)31510
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e6012
19.1%
d4500
14.3%
n4444
14.1%
p3044
9.7%
i2978
9.5%
o1522
 
4.8%
A1522
 
4.8%
r1522
 
4.8%
v1522
 
4.8%
D1512
 
4.8%
Other values (2)2932
9.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown)31510
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e6012
19.1%
d4500
14.3%
n4444
14.1%
p3044
9.7%
i2978
9.5%
o1522
 
4.8%
A1522
 
4.8%
r1522
 
4.8%
v1522
 
4.8%
D1512
 
4.8%
Other values (2)2932
9.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown)31510
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e6012
19.1%
d4500
14.3%
n4444
14.1%
p3044
9.7%
i2978
9.5%
o1522
 
4.8%
A1522
 
4.8%
r1522
 
4.8%
v1522
 
4.8%
D1512
 
4.8%
Other values (2)2932
9.3%

PatientIncome
Real number (ℝ)

High correlation  Unique 

Distinct4500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84384.284
Minimum20006.87
Maximum149957.52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.3 KiB
2026-01-11T18:22:02.992307image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum20006.87
5-th percentile26070.636
Q152791.905
median84061.205
Q3115768.42
95-th percentile142561.27
Maximum149957.52
Range129950.65
Interquartile range (IQR)62976.513

Descriptive statistics

Standard deviation37085.909
Coefficient of variation (CV)0.43948834
Kurtosis-1.1707225
Mean84384.284
Median Absolute Deviation (MAD)31383.245
Skewness0.015295257
Sum3.7972928 × 108
Variance1.3753646 × 109
MonotonicityNot monotonic
2026-01-11T18:22:03.212134image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
131676.021
 
< 0.1%
57595.111
 
< 0.1%
140772.721
 
< 0.1%
69803.191
 
< 0.1%
138895.981
 
< 0.1%
96529.571
 
< 0.1%
28830.411
 
< 0.1%
111654.491
 
< 0.1%
20440.421
 
< 0.1%
131764.741
 
< 0.1%
Other values (4490)4490
99.8%
ValueCountFrequency (%)
20006.871
< 0.1%
20031.311
< 0.1%
20031.581
< 0.1%
20053.341
< 0.1%
20093.191
< 0.1%
20102.641
< 0.1%
20117.761
< 0.1%
20122.61
< 0.1%
20166.981
< 0.1%
20278.321
< 0.1%
ValueCountFrequency (%)
149957.521
< 0.1%
149935.671
< 0.1%
149913.571
< 0.1%
149857.611
< 0.1%
149837.51
< 0.1%
149820.251
< 0.1%
149819.491
< 0.1%
149812.831
< 0.1%
149794.761
< 0.1%
149728.971
< 0.1%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size246.2 KiB
Married
1181 
Widowed
1127 
Divorced
1101 
Single
1091 

Length

Max length8
Median length7
Mean length7.0022222
Min length6

Characters and Unicode

Total characters31510
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSingle
2nd rowWidowed
3rd rowMarried
4th rowMarried
5th rowDivorced

Common Values

ValueCountFrequency (%)
Married1181
26.2%
Widowed1127
25.0%
Divorced1101
24.5%
Single1091
24.2%

Length

2026-01-11T18:22:03.367861image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:03.442565image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
married1181
26.2%
widowed1127
25.0%
divorced1101
24.5%
single1091
24.2%

Most occurring characters

ValueCountFrequency (%)
d4536
14.4%
i4500
14.3%
e4500
14.3%
r3463
11.0%
o2228
 
7.1%
a1181
 
3.7%
M1181
 
3.7%
W1127
 
3.6%
w1127
 
3.6%
D1101
 
3.5%
Other values (6)6566
20.8%

Most occurring categories

ValueCountFrequency (%)
(unknown)31510
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
d4536
14.4%
i4500
14.3%
e4500
14.3%
r3463
11.0%
o2228
 
7.1%
a1181
 
3.7%
M1181
 
3.7%
W1127
 
3.6%
w1127
 
3.6%
D1101
 
3.5%
Other values (6)6566
20.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown)31510
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
d4536
14.4%
i4500
14.3%
e4500
14.3%
r3463
11.0%
o2228
 
7.1%
a1181
 
3.7%
M1181
 
3.7%
W1127
 
3.6%
w1127
 
3.6%
D1101
 
3.5%
Other values (6)6566
20.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown)31510
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
d4536
14.4%
i4500
14.3%
e4500
14.3%
r3463
11.0%
o2228
 
7.1%
a1181
 
3.7%
M1181
 
3.7%
W1127
 
3.6%
w1127
 
3.6%
D1101
 
3.5%
Other values (6)6566
20.8%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size250.7 KiB
Employed
1188 
Unemployed
1141 
Student
1110 
Retired
1061 

Length

Max length10
Median length8
Mean length8.0246667
Min length7

Characters and Unicode

Total characters36111
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEmployed
2nd rowEmployed
3rd rowStudent
4th rowEmployed
5th rowUnemployed

Common Values

ValueCountFrequency (%)
Employed1188
26.4%
Unemployed1141
25.4%
Student1110
24.7%
Retired1061
23.6%

Length

2026-01-11T18:22:03.536414image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:03.609016image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
employed1188
26.4%
unemployed1141
25.4%
student1110
24.7%
retired1061
23.6%

Most occurring characters

ValueCountFrequency (%)
e6702
18.6%
d4500
12.5%
t3281
9.1%
m2329
 
6.4%
y2329
 
6.4%
l2329
 
6.4%
o2329
 
6.4%
p2329
 
6.4%
n2251
 
6.2%
E1188
 
3.3%
Other values (6)6544
18.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)36111
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e6702
18.6%
d4500
12.5%
t3281
9.1%
m2329
 
6.4%
y2329
 
6.4%
l2329
 
6.4%
o2329
 
6.4%
p2329
 
6.4%
n2251
 
6.2%
E1188
 
3.3%
Other values (6)6544
18.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)36111
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e6702
18.6%
d4500
12.5%
t3281
9.1%
m2329
 
6.4%
y2329
 
6.4%
l2329
 
6.4%
o2329
 
6.4%
p2329
 
6.4%
n2251
 
6.2%
E1188
 
3.3%
Other values (6)6544
18.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)36111
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e6702
18.6%
d4500
12.5%
t3281
9.1%
m2329
 
6.4%
y2329
 
6.4%
l2329
 
6.4%
o2329
 
6.4%
p2329
 
6.4%
n2251
 
6.2%
E1188
 
3.3%
Other values (6)6544
18.1%
Distinct3876
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size268.4 KiB
2026-01-11T18:22:03.907367image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length22
Median length19
Mean length12.056222
Min length6

Characters and Unicode

Total characters54253
Distinct characters50
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3417 ?
Unique (%)75.9%

Sample

1st rowNew Alishaview
2nd rowEast Curtis
3rd rowLake Jennifer
4th rowMartinstad
5th rowThomasfurt
ValueCountFrequency (%)
east336
 
5.0%
north333
 
4.9%
south330
 
4.9%
lake324
 
4.8%
west317
 
4.7%
port304
 
4.5%
new298
 
4.4%
michael32
 
0.5%
jennifer20
 
0.3%
james20
 
0.3%
Other values (3077)4428
65.7%
2026-01-11T18:22:04.356322image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e5232
 
9.6%
t4293
 
7.9%
a4276
 
7.9%
r4221
 
7.8%
o3682
 
6.8%
h3002
 
5.5%
n2964
 
5.5%
i2682
 
4.9%
s2615
 
4.8%
2242
 
4.1%
Other values (40)19044
35.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)54253
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e5232
 
9.6%
t4293
 
7.9%
a4276
 
7.9%
r4221
 
7.8%
o3682
 
6.8%
h3002
 
5.5%
n2964
 
5.5%
i2682
 
4.9%
s2615
 
4.8%
2242
 
4.1%
Other values (40)19044
35.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)54253
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e5232
 
9.6%
t4293
 
7.9%
a4276
 
7.9%
r4221
 
7.8%
o3682
 
6.8%
h3002
 
5.5%
n2964
 
5.5%
i2682
 
4.9%
s2615
 
4.8%
2242
 
4.1%
Other values (40)19044
35.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)54253
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e5232
 
9.6%
t4293
 
7.9%
a4276
 
7.9%
r4221
 
7.8%
o3682
 
6.8%
h3002
 
5.5%
n2964
 
5.5%
i2682
 
4.9%
s2615
 
4.8%
2242
 
4.1%
Other values (40)19044
35.1%

ClaimType
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size253.9 KiB
Outpatient
1152 
Routine
1149 
Inpatient
1128 
Emergency
1071 

Length

Max length10
Median length9
Mean length8.7453333
Min length7

Characters and Unicode

Total characters39354
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowInpatient
2nd rowInpatient
3rd rowEmergency
4th rowRoutine
5th rowInpatient

Common Values

ValueCountFrequency (%)
Outpatient1152
25.6%
Routine1149
25.5%
Inpatient1128
25.1%
Emergency1071
23.8%

Length

2026-01-11T18:22:04.470949image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:04.543978image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
outpatient1152
25.6%
routine1149
25.5%
inpatient1128
25.1%
emergency1071
23.8%

Most occurring characters

ValueCountFrequency (%)
t6861
17.4%
n5628
14.3%
e5571
14.2%
i3429
8.7%
u2301
 
5.8%
a2280
 
5.8%
p2280
 
5.8%
O1152
 
2.9%
R1149
 
2.9%
o1149
 
2.9%
Other values (7)7554
19.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)39354
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t6861
17.4%
n5628
14.3%
e5571
14.2%
i3429
8.7%
u2301
 
5.8%
a2280
 
5.8%
p2280
 
5.8%
O1152
 
2.9%
R1149
 
2.9%
o1149
 
2.9%
Other values (7)7554
19.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)39354
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t6861
17.4%
n5628
14.3%
e5571
14.2%
i3429
8.7%
u2301
 
5.8%
a2280
 
5.8%
p2280
 
5.8%
O1152
 
2.9%
R1149
 
2.9%
o1149
 
2.9%
Other values (7)7554
19.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)39354
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t6861
17.4%
n5628
14.3%
e5571
14.2%
i3429
8.7%
u2301
 
5.8%
a2280
 
5.8%
p2280
 
5.8%
O1152
 
2.9%
R1149
 
2.9%
o1149
 
2.9%
Other values (7)7554
19.2%
Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size238.9 KiB
Paper
1544 
Phone
1495 
Online
1461 

Length

Max length6
Median length5
Mean length5.3246667
Min length5

Characters and Unicode

Total characters23961
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPaper
2nd rowOnline
3rd rowOnline
4th rowPhone
5th rowPhone

Common Values

ValueCountFrequency (%)
Paper1544
34.3%
Phone1495
33.2%
Online1461
32.5%

Length

2026-01-11T18:22:04.640599image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:04.708740image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
paper1544
34.3%
phone1495
33.2%
online1461
32.5%

Most occurring characters

ValueCountFrequency (%)
e4500
18.8%
n4417
18.4%
P3039
12.7%
p1544
 
6.4%
a1544
 
6.4%
r1544
 
6.4%
h1495
 
6.2%
o1495
 
6.2%
O1461
 
6.1%
l1461
 
6.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)23961
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e4500
18.8%
n4417
18.4%
P3039
12.7%
p1544
 
6.4%
a1544
 
6.4%
r1544
 
6.4%
h1495
 
6.2%
o1495
 
6.2%
O1461
 
6.1%
l1461
 
6.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)23961
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e4500
18.8%
n4417
18.4%
P3039
12.7%
p1544
 
6.4%
a1544
 
6.4%
r1544
 
6.4%
h1495
 
6.2%
o1495
 
6.2%
O1461
 
6.1%
l1461
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)23961
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e4500
18.8%
n4417
18.4%
P3039
12.7%
p1544
 
6.4%
a1544
 
6.4%
r1544
 
6.4%
h1495
 
6.2%
o1495
 
6.2%
O1461
 
6.1%
l1461
 
6.1%

Cluster
Categorical

High correlation 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size219.9 KiB
3
1152 
0
1144 
2
1104 
1
1100 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4500
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row2
3rd row3
4th row2
5th row1

Common Values

ValueCountFrequency (%)
31152
25.6%
01144
25.4%
21104
24.5%
11100
24.4%

Length

2026-01-11T18:22:04.795600image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:04.872952image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
31152
25.6%
01144
25.4%
21104
24.5%
11100
24.4%

Most occurring characters

ValueCountFrequency (%)
31152
25.6%
01144
25.4%
21104
24.5%
11100
24.4%

Most occurring categories

ValueCountFrequency (%)
(unknown)4500
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
31152
25.6%
01144
25.4%
21104
24.5%
11100
24.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown)4500
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
31152
25.6%
01144
25.4%
21104
24.5%
11100
24.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown)4500
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
31152
25.6%
01144
25.4%
21104
24.5%
11100
24.4%

ClaimLegitimacy
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size258.1 KiB
Legitimate
4230 
Fraud
 
270

Length

Max length10
Median length10
Mean length9.7
Min length5

Characters and Unicode

Total characters43650
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLegitimate
2nd rowLegitimate
3rd rowLegitimate
4th rowLegitimate
5th rowLegitimate

Common Values

ValueCountFrequency (%)
Legitimate4230
94.0%
Fraud270
 
6.0%

Length

2026-01-11T18:22:04.966515image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2026-01-11T18:22:05.026985image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
legitimate4230
94.0%
fraud270
 
6.0%

Most occurring characters

ValueCountFrequency (%)
e8460
19.4%
t8460
19.4%
i8460
19.4%
a4500
10.3%
L4230
9.7%
g4230
9.7%
m4230
9.7%
F270
 
0.6%
r270
 
0.6%
u270
 
0.6%

Most occurring categories

ValueCountFrequency (%)
(unknown)43650
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e8460
19.4%
t8460
19.4%
i8460
19.4%
a4500
10.3%
L4230
9.7%
g4230
9.7%
m4230
9.7%
F270
 
0.6%
r270
 
0.6%
u270
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown)43650
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e8460
19.4%
t8460
19.4%
i8460
19.4%
a4500
10.3%
L4230
9.7%
g4230
9.7%
m4230
9.7%
F270
 
0.6%
r270
 
0.6%
u270
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown)43650
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e8460
19.4%
t8460
19.4%
i8460
19.4%
a4500
10.3%
L4230
9.7%
g4230
9.7%
m4230
9.7%
F270
 
0.6%
r270
 
0.6%
u270
 
0.6%

Interactions

2026-01-11T18:21:55.985954image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:55.215004image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:55.528050image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:56.088920image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:55.316011image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:55.623839image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:56.187119image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:55.424124image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2026-01-11T18:21:55.721564image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2026-01-11T18:22:05.094499image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ClaimAmountClaimLegitimacyClaimStatusClaimSubmissionMethodClaimTypeClusterPatientAgePatientEmploymentStatusPatientGenderPatientIncomePatientMaritalStatusProviderSpecialty
ClaimAmount1.0000.4060.0000.0060.0440.0220.0090.0070.0220.0190.0140.017
ClaimLegitimacy0.4061.0000.0000.0000.0000.4370.0300.0130.0090.4000.0000.000
ClaimStatus0.0000.0001.0000.0140.0000.0050.0160.0130.0000.0000.0180.024
ClaimSubmissionMethod0.0060.0000.0141.0000.0090.0000.0110.0000.0000.0260.0230.018
ClaimType0.0440.0000.0000.0091.0000.0230.0000.0000.0000.0260.0000.000
Cluster0.0220.4370.0050.0000.0231.0000.0290.0000.0040.9200.0000.000
PatientAge0.0090.0300.0160.0110.0000.0291.0000.0000.0000.0170.0000.014
PatientEmploymentStatus0.0070.0130.0130.0000.0000.0000.0001.0000.0000.0000.0000.000
PatientGender0.0220.0090.0000.0000.0000.0040.0000.0001.0000.0280.0170.018
PatientIncome0.0190.4000.0000.0260.0260.9200.0170.0000.0281.0000.0080.000
PatientMaritalStatus0.0140.0000.0180.0230.0000.0000.0000.0000.0170.0081.0000.000
ProviderSpecialty0.0170.0000.0240.0180.0000.0000.0140.0000.0180.0000.0001.000

Missing values

2026-01-11T18:21:56.366436image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2026-01-11T18:21:56.545056image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

ClaimIDPatientIDProviderIDClaimAmountClaimDateDiagnosisCodeProcedureCodePatientAgePatientGenderProviderSpecialtyClaimStatusPatientIncomePatientMaritalStatusPatientEmploymentStatusProviderLocationClaimTypeClaimSubmissionMethodClusterClaimLegitimacy
04d76c7f7-d36a-4139-b451-a9a4ad10d7d519cf2638-3ec0-4ed9-9995-d9ba4553813aa3d0cc80-dffe-40ff-a302-23c8ffeedb367820.522024-07-08Ta150iO01396FOrthopedicsPending57595.11SingleEmployedNew AlishaviewInpatientPaper3Legitimate
1e35193b4-3609-492b-866a-98de19317e9c5c4bb6c5-4dd3-4a86-85fa-f36c0d8debffa9f25acf-92b8-45e2-9cef-87bd07d0a5915453.862024-07-08Fo766jR34995MCardiologyDenied140772.72WidowedEmployedEast CurtisInpatientOnline2Legitimate
21f3fa373-25ed-4ff4-b6c7-38dcb2fb297f777866e0-4d10-45a8-a7b4-dbdaa26d5a81951b1e08-9948-4956-80e5-9277f16bd2908229.862024-07-08AX876uU47910MCardiologyPending69803.19MarriedStudentLake JenniferEmergencyOnline3Legitimate
3af6a68f4-8319-47b1-a28b-77de015728519d7c53ee-eb1a-4f07-9e3a-e86cf82e9f0fde9e193a-f9a1-4d63-9345-aefe756946289519.162024-07-08SQ441Xs26459FCardiologyPending135530.12MarriedEmployedMartinstadRoutinePhone2Legitimate
4417fe944-79d2-4610-81c4-a2d496f29ee4db14b0ca-ac2a-4e83-b085-947ea32e75875c7d7045-71b6-4c15-937c-43e4cfe65bf43226.152024-07-08FK970PV47636FNeurologyApproved36995.52DivorcedUnemployedThomasfurtInpatientPhone1Legitimate
541c69c3f-7b63-435c-841f-97633264a3479caba0e6-334d-4132-9330-1c1adaa82d1111ff25ad-29c9-493b-a356-cb0c6a8f41a63476.562024-07-07ZE958Am15926FCardiologyDenied96819.09DivorcedRetiredNorth MichaelOutpatientPaper0Legitimate
680a92d69-9d51-476c-8d1d-0ea35a7081a9c4daf0c4-8d67-4aba-97db-442a948db4d319d62078-bb03-4473-8815-5f814c12b5c86468.552024-07-07hg131vm2403MNeurologyDenied117271.04MarriedEmployedWest PaulEmergencyPaper2Legitimate
731c804c9-110c-4c26-bf38-2638b9e2952671d7c4ac-c608-4392-8f71-83ce85d00595c1bfab96-0df6-4a49-96e1-236bd3c6a7b5280.402024-07-07Xa559eD73399MPediatricsDenied125318.21WidowedEmployedAmbermouthInpatientPaper2Legitimate
825d801f8-d141-4131-9f1f-0c63360b4302919f254e-a7eb-41da-8f14-b2ee11aad6da8d1a5376-5ea6-42e6-beec-2c1313a30a494661.712024-07-07Sj663uq05857FNeurologyPending24263.98WidowedEmployedLarsonvilleInpatientPaper1Legitimate
98b5172a0-9aab-439a-9e3f-d0af5f2a1b6b6d707925-803e-42b8-af6e-e77d9f45dd8aed55392c-f9e3-469c-9367-00df827b1cf69638.642024-07-07Qu671Gw54991MPediatricsApproved78191.10WidowedUnemployedSouth JessicaburyOutpatientPhone3Legitimate
ClaimIDPatientIDProviderIDClaimAmountClaimDateDiagnosisCodeProcedureCodePatientAgePatientGenderProviderSpecialtyClaimStatusPatientIncomePatientMaritalStatusPatientEmploymentStatusProviderLocationClaimTypeClaimSubmissionMethodClusterClaimLegitimacy
4490a996469c-9d91-437d-9a13-33384ec86e276c35f381-e4fc-4d5b-a4e4-fc8abeed154a04a1cc5f-73cc-4bca-b6f0-af3fc2529ec45879.352022-07-10FC909jB02084FCardiologyDenied129931.42DivorcedUnemployedJacobsbergOutpatientPaper2Legitimate
44916c85c4f1-bc4b-46fb-a573-3fea7853cb3805ec1ef6-bef2-43af-befa-c00eb68af7a967cd32ac-9518-417a-aa8e-bc46911951e46250.802022-07-10aW066su89691MGeneral PracticePending43191.77MarriedEmployedLake EdwardmouthRoutinePhone1Legitimate
44924e2838d1-2819-441f-a435-97c22b9f4e8bd708e793-c837-40e4-8d4c-852a94b4e87f3fc3e1bd-a685-4a9e-8014-1d81a6eaffe58290.292022-07-10tl190mO87022MOrthopedicsDenied86328.07WidowedRetiredGarcialandInpatientPhone0Legitimate
44938a2ffe23-6145-49b3-89a4-1013c4e858e2516ea776-3d69-4c5f-a9d2-bae1698b28d22fcfedff-0300-4c6d-82ba-cb4858c7487f9102.272022-07-09ZQ868GC2020MOrthopedicsApproved48185.86MarriedStudentSouth AnthonyfurtEmergencyPhone1Fraud
44944c4e4abc-e65d-485e-9882-c44485e63917f3697794-b8d7-4c0d-a18c-72e5cab95d9598f91962-bcf3-482b-8ea7-f003d74c86ae1189.512022-07-09Ux531bJ95668FPediatricsDenied108225.81MarriedEmployedHerreraboroughRoutinePaper0Legitimate
44956c427360-20ae-43b8-802f-bd25fae3ce09c0ddd919-1b16-4689-9963-7566ba410835c0039b67-ace3-4f97-a646-4214419f9fdf3041.502022-07-09qJ110bn80610MGeneral PracticeDenied80395.76WidowedStudentNew MelissastadEmergencyPaper3Legitimate
449643b72c25-94ae-4f1f-a2fb-cb329797867402ea4377-cf98-4251-a1d6-8eb720d903d82dcbfa56-e73a-42b5-bfbf-02bfb9b3f9905153.282022-07-09dc670wX32996FNeurologyPending31560.84WidowedRetiredLake CathymouthOutpatientPhone1Legitimate
4497e0bf8e55-7440-48bb-9583-187ab12a568214844cfb-2bff-4be5-8540-7d58c72ed309ae3fdf78-c574-495a-ba8a-2246ba1d61a56908.452022-07-09cF152aT40297FPediatricsDenied74973.94MarriedUnemployedGaryboroughInpatientOnline3Legitimate
44981a3f947a-f3a7-4286-8925-aed2eced6ee2cfedbf0b-43eb-4dbe-a26b-74bd566898c8d344683d-f2e2-4262-8c04-f9e92fda1d335830.192022-07-09Sc398wv34214FGeneral PracticeApproved147665.80WidowedStudentEast ClaudiafurtRoutinePaper2Legitimate
4499291cfa64-9956-40e7-b89f-4628650f42f02bd2d173-4ce1-428d-836c-259d9236a839cf84cf99-0ac3-465a-af90-239a873bafa55848.922022-07-09TQ972Sn27347MPediatricsApproved131676.02MarriedUnemployedNorth AmberboroughInpatientPhone2Legitimate